Efficient Dynamic Multiple GPGPU Layer for OpenCV
نویسنده
چکیده
General purpose graphic processing unit (GPGPU) provides high performance resource for computing. CUDA (Compute Unified Device Architecture) and OpenCL (Open Computing Language) permit writing of parallel computing programs that utilize multiple central processing units (CPU) and GPGPUs. The image processing library, OpenCV (Open Source Computer Vision library), may benefit greatly from parallel use of multiple GPGPUs, however, its CUDA implementation is restricted to benefiting from a single GPGPU only. This research develops an abstraction layer above OpenCV single GPU module that enables multiple GPUs for single instruction multiple data (SIMD) architecture. This approach has a controller/parent thread which generates various worker threads to operate on several GPU devices, to handle balancing of work load on GPUs, as the task allocation is dynamic for any number of GPUs. The experiments on running bilateral filtering, color to gray conversion, fast Fourier transform, and convolution on homogeneous and heterogeneous sized images of scenery, objects, and faces, indicate that: (1) threading reduces computation time by half of sequential operation for GPU; (2) tuned static load balanced GPU threading reduces computation time by up to a fourth
منابع مشابه
GPU Based Real-time Trinocular Stereovision
Stereovision has been applied in many fields including UGV (Unmanned Ground Vehicle) navigation and surgical robotics. Traditionally most stereovision applications are binocular which uses information from a horizontal 2-camera array to perform stereo matching and compute the depth image. Trinocular stereovision with a 3-camera array has been proved to provide higher accuracy in stereo matching...
متن کاملSoft GPGPUs for Embedded FPGAs: An Architectural Evaluation
We present a customizable soft architecture which allows for the execution of GPGPU code on an FPGA without the need to recompile the design. Issues related to scaling the overlay architecture to multiple GPGPU multiprocessors are considered along with application-class architectural optimizations. The overlay architecture is optimized for FPGA implementation to support efficient use of embedde...
متن کاملVideo-based 3D Reconstruction of Moving Scenes Using Multiple Stationary Cameras1)
In this paper, we describe a system for video-based 3D reconstruction of dynamic scenes using stereo techniques, with an eye to potential applications in human motion capture. We incorporate into our approach recent research results on stereo matching which make the system efficient and produce good-quality results. The implementation is built on top of Intel’s Open Source Computer Vision Libra...
متن کاملMulti-layer Clustering Topology Design in Densely Deployed Wireless Sensor Network using Evolutionary Algorithms
Due to the resource constraint and dynamic parameters, reducing energy consumption became the most important issues of wireless sensor networks topology design. All proposed hierarchy methods cluster a WSN in different cluster layers in one step of evolutionary algorithm usage with complicated parameters which may lead to reducing efficiency and performance. In fact, in WSNs topology, increasin...
متن کاملDrowsiness Detection for Drivers Using Computer Vision
Drowsiness detection system is regarded as an effective tool to reduce the number of road accidents. This project proposes a non-intrusive approach for detecting drowsiness in drivers, using Computer Vision. The algorithm is coded on OpenCV platform in Linux environment. The parameters considered to detect drowsiness are face and eye detection, blinking, eye closure and gaze. Input is captured ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017